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DUAL MODE DATA COMPRESSION TECHNIQUE 



RELATED APPLICATIONS 



[00011_ This application is related to U.S. application Ser. No. 



t09/750.188. ( Attorney 



Docket: 3175-51), entitled -"Enhanced Data Compression Technique^" and filed concurrently 
herewith on Dec. 29, 2000. 



[00021 The present application relates generally to data compression and more particularly to an 
enhanced data compression techniqu e. This technique is particularly suitable for use in the 
graphical arts for compressing large images. 



[00031 In the graphic arts there is a tendency to have extremely large, one-bit-per-sample images 
approaching or even exceeding 2 gigabytes of data. The need to compress such data has been 
well known for many years. 

[00041 One proposed technique for compressing such data is commonly referred to as a pack bit 
IPackBits. hereinafter "P B)," compression technique. Using propose d The PB compression 
techniques. technique produces either a string of characters-is preceded with a count and a repeat 
character code o r, alternatively, a single byte pattern is-preceded with a count. Propos o d The PB 
compression techniques areis capable of processing data very quickly. These techniques This 
technique also provides satisfactory results i%hen the data is eithe ra string of solid black or solid 
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respectively . Accordingly, the PB techniques provides reasonably satisfactory results for non- 
color image data. 

f00051_ An exemplary pack bits PackBits representation of a stream of sequential input data, as it 
would appear entering a processor prior to encoding, might include the string of characters 
-"abcOOOOOOOOOO-". Using the PB technique, the processor would first determined whether or 
nef-fe edetermine if a first character ^"a-" and thea second character ""b " matc h " are the same 
character . Underf n some proposed PB techniques, the processor might scan ahead to consider 
other matches in certain of the subsequent characters when determining if a stream contains the 
same repeated character . In any event, s ince in the present example^ the determination 
^comparison that determines if "a" and "b" are the same character returns a negative^ th eresult. 
The processo r then proceeds to encode the input data as a literal string with a length. Next Tthe 
processor nex^determines whether or noti f the second character ^"b-" and the third character 
-"c " match " are the same character . Since this determination is also negative, the processor will 
proceed to encode the three characters of the input data as a literal string with a length. The 
processor newthen determines whether or noti f the third character ""c-" and the fourth character 
-"0 " matc h " are the same character . Since this determination is also negative, the processor will 
proceed to encode the four characters of the input data as a literal string with a length. The 
processor continues by det e rmining whether or no t and determines if the fourth character ^"0-" 
and the fifth character -"0 " matc h "are the same character . Since this determination is positive, 
the processor continues by repeatedly determining whether or noti f the immediately subsequent 
characters in the sequence are also mateh rthe same character until it makes a negative 
determination. The processor thereby determines the repeat count for the character ^"0-". Based 
on the initial positive determination, the processor also proceeds to encode the first three 
characters of the input data sequence, i.e. ^a-,"13"" and ""c"," as a literal string with a length 
and the following 10 characters of the input data sequence, i.e. the -"0^_. . . ^^y V'O," as a repeat 
character with a count.- 



[00061_ Accordingly, the processor generates encoded output data forming a 2-byte sequence 
including the strings of characters ^"82abc^" and ^"090^\ In the output data, the -"8"!! serves as 
a header and i ndi c ates indicating that the total length of the sequence is 8 bits and that a literal 
string followsy-^tThe— _!!2-" indicates that the length of the literal string is three characters, i.e. 
characters ^z-Z ^Z and ^"c^!\_ tjhe first indicates that a repeat character follows, and 
the ^9^" indicates that the repeat character rep resen ted by A the second ^"0," is repeated 10 
times. Using one-off numbers such as the "2" to indicate a literal string of 3 characters, and the 
"9" to indicate that a repeat character is repeated 10 times. Using one-off numbers such as the 
"2" to indicate a literal string of 3 characters, and the "9" to indicate that a repeat character is 
repeated 10 times, is efficient because 129 bytes can be packed using number up to 128. 

[00071_ To decode the encoded sequence - '82abc 090". 090," the receiving processor first reads 
the new-header Ii 8 i y'8," which is the highest order bit , and from . From the heade r, the processor 
determines that a literal string follows. The processor then extracts the string length, ^"2." and 
reads the next three characters ^"a^," ^"b-" and ^"c^". At this stage, the output string is "abc" 
and the remaining input string is "090." The processor-^iex t then reads the first ""0 " and from 
this determin e s " which indicates that a repeat character follows. The processor continues by 
extracting the repeat count, ^"9," and readin gthen reads the next character "0", which is "0," the 
character to be repeated 10 times. It will be recogniz e d that by u s ing one off numbers such as the 
"2" to indicate a literal The resulting decoded string of 3 charact e rs an d is "abcOOOOOOOOOO." is 
the ^string originally presented prior to indicat e that a repeat character is repeated 10 times, a 
clos e to 1% improv e m e nt i s obtainabl e be c ause 1 28 byt e s can b e packed into 129. encoding. 

[00081, As should be clear from the above, the P B techniques processes only one character at a 
time. Accordingly, PB techniques ar e i s incapable of compressing strings of repeating multiple z 
byte patterns of characters . The PB techniques also haves a relatively limited compression rate, 
generally no more than 64 to 1. Thus, the PB compression techniques provides unsatisfactory 
results when used to compress color image dat a, which typically contains repeating multi-byte 
patterns of characters instead of repeating single-byte 0s and Is . 



[00091 _ Another proposed technique for compressing image data is commonly referred to as the 
Lempel-Ziv-Welch 4, hereinafter " LZW)." compression technique. Using proposed the LZW 
compression techniques technique. variable length e^strings of byte based data can be processed. 
Proposed The LZW compression technique^ processes the data somewhat slower than the PB 
compression techniques, but provides satisfactory results on data representing both color images 
as w e ll as and black z and -white images. However, since these techniques are based on single 
bytes of data, such techniques are incapable of compressing data on an arbitrary pixel or bit 
boundary basis. Additionally, although such techniques, ar e though LZW is capable of providing 
a higher compression rate than PB . LZW's compression techniques, LZW techniquesi ateis still 
offer a somewhat limited compression rate . 

An e x e mplary LZW r e pre s entation of a stream of sequential input data, as it would app e ar 
e nt e ring a proc e ssor prior to e ncoding, might include the string of charact e rs "abcOlcOl". [00101_ 
Using the LZW technique, the encoding and decoding processors must coordinate en-the 
transmission and receipt of codes. The LZW techniques uses a compression dictionary 
containing some limited number of compression codes defined during the processing of the input 
data. The characters in the input string are read on a character by character basis to determine if 
a sub-string of characters match a compression code defined during the processing of prior 
characters in the input string. If s ea pattern match is found , the matching sub-string of characters 
areis encoded with the applicable compression code. If a sub-string of characters does not match 
a pre-existing code, a new code corresponding to the sub-string is added to the dictionary. Sub- 
strings are initially defined by codes having 9 bits or digits , but the number of bits may be 
increased up to 12 bits to add new codes. Once the 12 -bit limit is exceeded, the dictionary is 
reset and subsequent codes are again defined initially with 9 bits. In conventional 
implementations of the LZW techniques, two codes are predefined, i.e. defined prior to initiating 
processing of the input string. In the present example these codes are the code 1 00, representing a 
"reset," and the code 101, representing an "end. !ljn the present example, the codes 102, 103, 
and 10 4 104, etc. represent strings of new patterns that are identified during the processing of the 
input data. 



[00111 An exemplary LZW representation of a stream of sequential input data, as it would 
a ppear entering a processor prior to encoding, might include the string of characters "abcOlcOl". 
Using the LZW technique, the encoding processor would first reads the -"a-" in the sequence and 
the -"b-" immediately thereafte r in the sequence . The processor then determines if a code exists 
for the character sequence -"ab-". Since, in this example, no such code exists at this point in the 
processing, a new code 103 is generated to represent the new pattern string ""ab - ' and is added 
to the existing code dictionary . The processor continues by reading the -"c-" immediately 
following the -"b-" in the sequence. The processor determines if a code exists for the character 
sequence -"be-". Since, in this example, no such code exists at this point in the processing, a 
new code 104 is generated to represent the new pattern string -"be -" and the code is added to the 
code dictionary . 

[00121_ The processor continues byand readings the -"0-" immediately following the -"c-" in the 
sequence. The processor determines if a code exists for the character sequence ""c0"". Since, in 
this example, no such code exists at this point in the processing, a new code 105 is generated to 
represent the new pattern string - 'c0 -" and the code is added to the code dictionary . The 
processor continues further by readin g and reads the — "1— " immediately following the ""0-" in the 
sequence. The processor determines if a code exists for the character sequence -"01-". Since, in 
this example, no such code exist at this point in the processing, a new code 106 is generated to 
represent the new pattern string - '01 -"and the code is added to the dictionary . The processor 
proceeds byand readings the ^"c-" immediately following the -"1-" in the sequence. The 
processor determines if a code exists for the character sequence -"lc-". Since, in this example, 
no such code exists at this point in the processing, a new code 107 is generated to represent the 
new pattern string ""lc" " and the new code is added to the dictionary . 

f00131_ The processor proceeds by reading the -"0"" immediately following the second -"c"" in 
the sequence. The processor determines if a code exists for the character sequence -"cO 11 ". In this 
example, such a code, i.e. code 105, does exist. The processor therefore proc ee d s by r e ading s 
determines if a longer pattern match can be made, and reads the ""1 "" immediately following the 
second ""c0"" in the sequence. The processor determines if a code exists for the character 
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sequence -"cOl-". Since, in this example, such a code does not exist, a new code 108 is 
generated to represent the new pattern string -"c&^OL" which ca n also be represented as 
-"1051-!!. The processor ultimately generates encoded output data forming a that forms the 
sequence including the string of charactersi -"100abc01c l051". The sequence, broken down into 
symbols represents: a reset (100): a literal string (abcOlc): a previously found pattern ( 1 05 -); and 
a literal (1) . 

[0014] J Using the LZW technique, the encoding processor builds a tree of codes generated using 
other codes. This is a primary reason why the LZW techniques provides satisfactory results even 
though processing is performed on a byte by byte basis to find repeating bytes rbyte patterns. That 
is, the downstream encoding builds on the upstream encoding. However, using the LZW 
technique, the encoding processor can take significant processing time to encode large sequences. 
For example, if there is a large , say a megabyte, occurrence of adjacent 0 ! s or l ! s, a significant 
period of time will be required by the processor to encode the sequence. 

[00151_ The decoding processor builds a similar tree from the codes received from the encoding 
processor. Basically, the decoding processor performs the reciprocal of the encoding process to 
decode the encoded sequence characters -"100abc01 1051-". 

[00161_ In summary, the PB compression technique is deficient in that it addresses only single 
byte repeats and is limited to a 64 to 1 compression rate. Therefore, it is not suitable for color 
images. On the other hand, while the LZW compression technique addresses multi-byte repeats 
and has a compression rate of perhaps 500 to 1, butit requires significant processing time to build 
the codes whic h that are required to obtain good compression. Hence, although the LZW 
technique may be suitable wher efor encoding relatively small amounts of dat a ar e involved , 
wher e the w hen encoding-ef gigabytes of dat a is required , such as with an 80 inchTkmesr x 50 
inch image having 2400 dots per inch, the processing time and/or resources necessary to encode 
dat a using the LZW technique make using the LZW techniqu e alone impractical. 



[001 71_ Accordingly, athe need exists for a technique which can quickly compress large amounts 
of image data, offer a still higher compression rate than previously proposed techniques, and 
provide satisfactory results when used to compress either color or non-color image data. 

OBJECTIVES OBJECTIVE SUMMARY OF THE INVENTION 



[00181_ It is an object of the present the invention to provid e provides a technique for quickly 
compressing large amounts of image data. 

[00191 It is a further object of the present the invention to provide provides a technique which 
facilitates high compression rates for eithe fboth color e fand non-color image data. 

It is vet another object of the pr e sen t [00201^Ihe invention to provide provides a technique 
whic h that gives satisfactory results when used to compress eitherboth color of and non-color 
image dat a. Additional objects, advantages, novel feature s of the present invention will b e come 
apparent to those s killed in the art from thi s disclosure, including the following detailed 
description, as well a s by practice of the invention. Whil e the invention is described below with 
r e ference to preferred embodiment(s), it should be understood that th e invention i s not limit e d 
thereto. Those of ordinary skill in the art having access to th e t e achings h e rein will recognize 
additional implementations, modifications, and embodiments, as well a s other fields of u se , 
which aro within the scope of the invention as disclosed and claimed herein and with respect to 
which the invention could be of significant utility . 

SUMMARY DISCLOSURE OF THE INVENTION 

[00211_ According to one embodiment of the-present invention, an encoder for compressing 
image information includes comprises a memory and aprocessor. The memory is configured to 
store a sequence of characters representing an image. The processor is configured to determine if 
the stored sequence of characters corresponds to either a banded image, such as a segment or 



slice across the entire image, or a page image, such as one of multiple separate images making up 
the entire image. The proc e ssor op e rates in a first mode to encod e the stored s equence of 
charact e rs, if jf the stored sequence of characters is determined to correspond to thea banded 
image^ Tthe processor operates in a second mode, different than the first mode^ to encode the 
stored sequence of characters, if _Jf the stored sequence of characters is determined to correspond 
to thea page image , the processor operates in a second mode to encode the stored sequence of 
characters . 



[00221_ Preferably, when the processor e ncodes the stor e d sequence of characters in accordance 
with a pack bit compression t e chnique is operating in the first mode of operatio n . PackBits 
compression is used, and when the processor is operating in accordance with a n the second mode. 
LZW compression t e chnique is used by default. However, if while in the second mode-ef- 
operation. Beneficially , the processor is also configured to encode the stored sequence of 
character s in accordance with a pack bi t determines that PackBits compression t e chnique in is_ 
appropriate, e.g. when presented with a string of repeating Os or Is. the second mod e of 
operation. processor may switch to using PackBits as the compression technique. Making this 
can b e beneficial in som e cases transition between compression techniques does not change the 
mode from second to first . The mode remains the same, only which compression technique is 
used is altered. 



[00231_ Advantageously, if the stored sequence during operation of characters is det e rmined to 
correspond to the pag esecond imagemode, the processor i scan be further configured to determine 
if the stored sequence of characters corresponds to a primarily white page image or a primarily 
black page imag e. For example , which that might be the case for a template typ epage. if the page 
is primarily white or primarily black. The processor encodes the stored sequence of characters 
usin g a PackBits compression technique. If the page imager-Ifs o is neither primarily black nor 
primarily white , the processor encodes the stored sequence of characters in accordance with a- 
first compression technique, e.g. a pack bit compression techniqu e , while operating in the-seeond- 
modc of operation. If not, th e processor encodes the stored first sequence of characters in 
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accordance with a second compression technique, different than the first compression technique, 
e.g. an LZW compression technique , while operating in the second mode of operation . 

[00241 _ In one practical implementation, an imaging system may include a raster image processor 
which determines if a sequence of characters corresponds to a banded image or a page image.- 
Th e If the sequence of characters is determined to correspond to a banded image, the raster 
image processor then operates in athe first mode to encode the sequence of characters4fLJf the 
sequence of characters is determined to correspond to thea band e dp age image, and to the 
processor operates in a second mode, different than the first mode, to encode the sequence of 
characters if th e s e quence of charact e rs i s determined to correspond to the page image . 

[00251 „ An imager controller receives the encoded sequence of characters. The imager controller 
then operates in either athe first mode o r the second mode to decode the received encoded 
sequence of characters back into the unencoded sequence of characters. More particularly, the- 
con t r o ller operates in a first mod e if the encoded sequence of characters corresponds to thea 
banded image, an dthe controller operates in athe secon d first mode- tf. If the encoded sequence of 
characters corresponds to thea page image , the controller operates in the second mode . 

[0026[_ Preferably, in the first mode of operation, the raster image processor encodes the 
sequence of characters in accordance wit h using a pack bi t PackBits compression technique-4 n. In 
the firstsecond mode of operation, and in accordanc e with a n the raster image processor uses the 
LZW compression technique in the second mode of operatio n by default . Beneficially, the raster 
image processor is also capable of encoding the sequence of characters in accordanc e with using a 
pack bi t PackBits compression technique in the second mode of operatio n if appropriate . 

[0027[_I n accordance with other aspects of the inventio n embodiments. while operating in the 
second mode , if the^rst sequence of characters is determined to correspond to thea page image, 
the raster image processorthen determines if the sequence of characters corresponds to aan_ 
image that is primarily white page image or a-primarily black page image . If so, the raster image 
processor encodes the sequence of characters in accordanc e with using a first compression 
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technique, for example, such as a pack - bit PackBits technique , whil e operating in th e second 
mode of operation . If ne tthe image is neither primarily black nor primarily white , the raster 
image processor encodes the sequence of characters in accordanc e with using a second 
compression technique which is different than , for example, the first compression technique, 
such as a LZW technique , while operating in the second mode of operation . 

BRIEF DESCRIPTION OF THE DRAWINGS 

[00281 _ FIG. 1 depicts an exemplary simplifiesd depiction of an image processing system in 
accordance with a first embodiment of the present invention. 

[0Q291_ FIG. 2 depicts an exemplary simplifiesd depiction of an image processing system in 
accordance with a second embodiment of the present invention. 

[00301 F IG. 3 depicts an exemplary code dictionary in accordance with the second embodiment 
of the present invention. 

[00311 FIG. 4 and FIG. 5 are flowcharts of exemplary embodiments of the invention. 

BEST MODE FOR CARRYING OUT DETAILED DESCRIPTION OF THE INVENTION 

[00321_ In pre-press imaging, particularly for-the flats having an entire plate worth of image 
information, most of the data is often either solid black or solid white, and henc e digitally 
represented in binary form by al ia stream of repeating 1-s or 0 ! S t, respectively. For halftone 
images all of the data is black and white e.. Ts and O's. 

FIGr-i0033jnCLl is a somewhat simplified, exemplary depiction of a image processing system 
1000 according to athe first embodiment of the present invention. The system 1000 includes a 
raster image processor- ^ hereinafter " RIP)," 1050, whie han imager controller 1 100. and an 
imager 1150. The raster processor includes a processor 1050a and a memory 1050b for storing 
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processing instructions and other dat a as r e quir e d . The RIP 1050 receives image data and 
converts the image data into encoded data. The mag eencoded data is then transmitted to anthe 
imager control processo r controller 1 100, which includes a processor 1000 1100 a and a memory 
1 100b for storing processing instructions and other dat a as required . Th e imager controller 1 100 
generates imager control signals to the imager 1 150 in accordance wit hb ased on the data received 
from the RIP 1050. to l050. The imager controller 1 100 sends the imager control signals to the 
imager 1 150. Mor e particularl y Specifically , the control signals from the processor 1 100a 
control the operation of the imager scanning assembly 1 150a so as to form the image on a 
medium 1 150c, such as A a film or plate, supported within the imager 1 150. As shown, the imager 
includes 1150 uses a cylindrical drum 1 150b ferto supporting the medium 1 150c . Alternatively , 
bu tthe imager 1150 could alt e rnativ e ly include use a flat bed or external drum for supporting the 
medium. 

[00341_ In a first mode of operation, which will hereafter be referred to as a flat banding "banded 
mode," the RIP 1050 receives an 80 inchrfcimesr x 50 inch color separated image having 2400 
dots per inch. Preferabl y The image could, for example, correspond to multiple pages of a 
magazine. In such a case, using imposition software on a front end preprocessor (not shown), the 
image could , for example, correspond to multiple pages of a magazine. In such a case, the image 
is be formatted such that the image printed from the imaged medium 1 150c is positioned so as to 
facilitate cutting, foldings and stitching to create multiple properly printed and positioned 
magazine pages. In any e vent, th e The RIP 1050 converts the entire image into multiple 
gigabytes of data encoded data as a single job. 

[00351_ However, due to processing power limitations of RIP 1050, the entire image cannot be 
converted into encoded data in a single operational process. Accordingl y lnstead . the RIP 1050 
slices the image is s liced into bands ? prior to the data b eing converted , typically by the^ JTie RIP 
1050. If converted by the RIP 1050, this processor 1050a may perform the b anding of the image- 
may be performed by the RIP proc e ssor 1050, OL a. Howev e r, conversion could also be preformed 
out s id e the RIP, for e xampl e by a preprocessor (not shown) could also perform the banding of 
the image . In the preferred embodiment, the RIP processor 1050a conv e rts encodes the image 
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data representing m each of the bands into encoded data in a separate operational process. Thus, 
the job of encoding all the image bands, and therefore the entire image, is completed only after 
the RIP 1050 performs multiple, separate operational processes are performed by the RIP 1050 
so as to convert all of the image data representing th e band s for th e entire image into encoded 
data. In practice, the larger the image A the smaller is-each image bandjs, with all bands 
preferably being equal in size. Furthermore, the larger the image^ the greater the startup time 
required before b e ginnin g encoding can begin. This limitation is caused by the conversion of the 
image data to encod e d data, because the larger the image, the mor e increased pre- 
conversio n encoding processing require d for larger images . Additionally, the more objects 
included in the image, the more memory that is required. 

[00361_ In athe second mode of operation, sometimes referr e d to as a hereinafter "p age assembly 
mode," the RIP 1050 receives, as multiple smaller images, an 80 inch4ime5r x 50 inch color 
image having 2400 dots per inch. In this case, one of the multiple images , whic h is primarily 
white. This image might be characterized as a template image , includ e s and include information 
such as registration marks, color gradients, and identification marks , but is primarily white . Each 
ef-th e The other of the multiple images could, for example, be the images for pages of a 
magazine, each image being a separate page of a magazin e. Here, the RIP 1050 maybe operated 
to conv e rt encode the image data representing the e ntire template image into e ncod e d data as one 
job and to convert encode all of image the data repr e senting of the other of the multipl e images into 
imag e data in as another job. When fully converte db oth jobs are complete , the multipl e 
imag e s entirety of the image will be repre s ented by encoded data . 

More particularly, i n f 00371_ [n the page assembly mode, the image is divided into page 
assemblies-^eOne of whie hthe pages is a primarily white template image whie hthat is. primarily 
white, and that is typically processed by th e RIP proc e ssor 1050a without being split into bands. 
The other of the multiple images are pages . however, are typically sliced into bands prior to being 
encoded by the RIP 1050. Because the area of each of the other multiple page images is much 
smaller than the area of the entire image discus s ed with r e ference to usedin the first mode of 
operation, fewer bands are required and, as . As a whole, it will take less time to conver t encode 
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the image data representing the multiple images into encoded data in the page assembly mode 
than the time required to conver t encode the image data representingjhe entire image into image 
datarin the bandmged mode discussed above. Thus, the RIP processor 1050a converts encodes 
the image data for each of the bands A ferin each of the oth e r multipl e images into encoded 
da tenon-template pages, in a separate operational process. The job, or jobs if the template image 
is pre-processed, is completed only after the multipl e op e rational processes are performed to 
convert all of the image data repr e senting the multiple images formin g pages. which together 
represent the entire image,, inteare encoded-data. 

[0038L Although, the image discussed with referenc e to in the page assembly mode description 
may be the same a&4he-image discussed with reference to in the prior page assemblyb anded mode. 
description , conversio n encoding in the page assembly mode will typically result in e ven a greater 
amount of encoded data than the conversio n encoding in the previously discussed banding mode. 
For example, the RIP 1050 may generate two gigabytes of encoded data may be generated by th e 
RIP 1050 to r e present the imag e in the bandinged mode, whit eyet generate three gigabytes of 
imageencoded data could be generat e d by the RIP 1050 to represent for the same image in the 
page assembly mode because ther e would b e . The discrepancy is due to the page assembly mode 
retaining more uncompressed data. Further, whether th e banding or page assembly mod e s are 
utilized by th e RIP 1050, the entir e imag e cannot b e conv e rt e d into encod e d data in a singl e 
op e rational process due to processing pow e r limitation s of the RIP 1050. 

[00391_ In the bandmged mode, the image bands may be satisfactorily converted using athe 
LZW PB technique. In the page assembly mode, a template image , of say 16 megabytes, may be 
satisfactorily converted using a PB technique since it is primarily white or primarily black and so 
is made up of mainly a repeating stream of 0s and Is. respectively . However, the PB technique 
will often produce unsatisfactory results t fwhen used to convert the bands of the other of the 
multiple images. Accordingly, in the page assembly mode, these bands are converted using an 
LZW technique. Thus, in the page assembly mode, different compression techniques are utilized 
for a single image and perhaps even in a single job. 
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Accordingly, in the firs t f00401 Referring to FIG. 4. accordingly, in one embodiment of the 
present invention, the RIP 1050 is selectively operabl e can operate in either the bandmged or the 
page assembly mode operation . Hence, in operation, the_ The RIP 1050 initially scans the 
received image data representing the imag e , or image bands if the bands ar e sliced during pre 
proc e ssing, to determine if bandinged mode or page assembly mode operations isare 
required appropriate . Alternatively, the image may be sliced into bands during pre-processing. 
(STEP 3000). If it is dctcrminc d the RIP 1050 determines (STEP 3010^ that bandiaged mode 
operation is required, the RIP 1050 implements an LZW is a ppropriate, it encodes the image data 
using the PB technique to convert the image data into encod e d data. (STEP 3020). I f. on th e 
oth e r hand, it is d e termined , however, the REP 1050 determines that page assembly mode 
op e ration is r e quir e d. is appropriate, it uses a different technique (STEP 30301 Referring to 
FIG. 5. the RIP 1050 further determines if the page image data represents a template image or 
banded imag e (STEP 3050) . If it i s determined that the page image data represents a template 
image, which as described is likely to be primarily black or white, the RIP 1050 implements uses 
athe PB technique to conv e rt encode the template image into e ncoded dat a (STEP 3020) . If, 
however, it is det e rmin e d that the page image data represents a banded image, the RIP 1050 
implements uses athe LZW technique to eonver tencode the banded image data into encoded 
dat afSTEP 3060) . The selective operation of the RIP 1050, depending o n in response to the 
receive d type of image dat a received, facilitates thea more efficient and effective processing of 
diff e rent type s of large images than has be e n was previously obtainabled in conventional RIPs. 

According to [004 11 In a second embodiment of the present invention, a sencoding can be 
interrupted and a more efficient compression technique may be applied. The invention chooses 
to interrupt the processing if the stream contains a section of all black or all white data. As a 
stream of sequential data is processed prior to encodings i f, at the start of the sequ e nc e , the 
immediately preceding character, which i shas yet to be encoded, matches the next character in 
the stream^ and this next character is either solid black or solid white fe.g. . and hence digitally 
repr e s e nted in binary form b y a stream of all V-sX o r solid white (e.g.. a stream of all 0 ! s), 
encoding is interrupted. During the interruption, a determination is made as to whether the 
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invention determines if one or more characters, immediately following the next character in the 
sequence, also match the next character. 

The secon d [00421 Another embodiment of the invention will now be described with reference to 
FIG. 2. As shown, FIG. 2 represents a somewhat simplified, exemplary depiction of an image 
processing system 2000. The system 2000 includes comprises a raster image processor-^, 
hereinafter " RJPV 2050, whie han imager controller 2100, and an imager 1 150. The RIP 2050 
receives an image and converts encodes the image into encoded data. The encoded data is then 
transmitted to imager controller 2100, which generates imager control signals to the imager 1150 
in accordance with th e e ncoded b ased on decoded data received from the RIP 2050 to control 
th e2050. The imager 1 150 after d e coding th e receiv e d datai nFIG. This imager 1150 2 is 
identical to the imager 1 150 of FIG. 1. More particularl y Specifically . the control signals from the 
imager controller processor 2100a control the operation of the imager scanning assembly 1 150a 
so as to form the image on a medium 1 1 50c ^. The whie hmedium could be identical to the 
medium 1 1 50c in FIG. 1 . The medium 1 1 50c is supported within the imager 1 1 50 of FIG. 2. As 
shown, the imager 1 150 includes a cylindrical drum 1 150b for supporting the medium 1 150c. 

[00431 In the second this embodiment of the present invention, the RIP processor 2050a 
implements a compression technique, which will hereafter be referred to as the "AGFA 
technique." The AGFA compr ess ion technique . Using the AGFA compression technique, 
variabl e length o f can process strings of byte based data can b e proc e ssed. Processing using the 
AGFA t e ch n i q u e w i ll be of variable length. Using the AGFA technique is substantially faster 
then the LZW compression technique for many large image applications t ha n t h e LZW 
compression techniqu e s , while still providing satisfactory results for both color images as well as 
those which that are primarily black -and -white. Furthe rmore , the AGFA technique is not limited 
to single bytes of data, and is therefore capable of compressing data on an arbitrary pixel or bit 
boundary basis. Additionally, the AGFA technique is capable of providing a higher compression 
rate than bot heither the PB an dor the LZW compression techniques implemented separately . 
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[00441_ An exemplary representation of a stream of sequential input data as it would appear 
entering a RIP processor 2050a prior to encoding could, for example, include the string of 
characters ^"abcO . . . OlcOl 11 " The string ^"0 . . . 0^" is a large string of z e ro's, for example 
r e pres e nting image information for 32 k pixels zeros. 

[00451J JsinR the AGFA technique, the encoding and decoding processors, i.e. the RIP processor 
2050a and imager controller processor 2100a, must coordinate en-the transmission and receipt of 
codes, similar to the coordination required by the LZW techniques. However, as will be 
described-further below, the AGFA technique uses a compression dictionary containing four pre- 
defined compression codes. The characters in the input string are scanned to determine if a 
scanned sub-string of characters match certain of these pre-defined compression codes. If so, the 
matching sub-string of characters is encoded with the applicable pre-defined compression code. 
If a sub-string of characters does not match a pre-existing code, anew codes corresponding to the 
sub-strings afeis added to the dictionary. 

Furthe r [0Q461 Furthermore , the AGFA technique provides a look- ahead function , in which to 
determine whether or not . The look-ahead function determines if the sub-string is greater than a 
minimum number, preferably 6^6 bytes, and if se-the sub-string is encoded with a new code r 
which includes ; the new code comprising any applicable pre-existing code^ and the length of the 
code field. The lengt h of the code field is the width of the pre-existing code, with thise 
eed elength forming the most significant bits and serving as a continuation indicato r, and any^ _ 
Any new coding, with this codin g code follows the length: the new code forming the least 
significant bits. Like the LZW techniques, sub-strings are initially defined by codes having 9 bits- 
or digits , but may be increased to up to 12 bits to add new codes. Once the 12 -bit limit is 
exceeded, the dictionary is reset and subsequent codes are again defined initially with 9 bits. 

[00471_ Referring to FIG. 3, in the AGFA technique, four codes are predefined and stored in the 
code dictionary 3QQO-e nl300 in the RIPls memory 2050b as codes 1330. In the present example 
these codes arei the code 100, representing a sub-string of all z e ro byt es zeroes. which 
corresponds to solid white r the code 101, representing a sub-string of all one bytes ones. which 
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corresponds to solid b lacky; the code 102, representing a resets and the code 103, representing 
aathe end of the compressed encoded data. In the present example, codes 104, 105, and 106 106. 
etc. represent sub-strings of new patterns which are generated during the processing of the input 
datea and also stored e nin the RIPls memory 2050b in the code dictionary 3000. 1300. It will be 
recognized that because codes for the strings corresponding to white and black are partially 
predefined , reduc e d proc e ssing is r e quir e d to g e n e rat e these cod es , since the . Since predefined 
codes can simply be read by the RIP processor 2050a from the dictionar y code s as . reduced 
processing is require d to generate these codes . 

[00481_ Using the AGFA technique, the RIP processor 2050a first sets a reset code 102 (read 
from the code dictionary 50001300} and reads the ^a^" in the sequence and the ^"b^" 
immediately thereafte r in th e sequenc e. The RIP processor 2050a then determines from the code 
dictionary 3000. 1300. if a code exists for the character sequence -"ab-". Since, in this example, 
no such code exists at this point in the processing, a new code 105 is generated to represent the 
new pattern string -"ab-" and the new code is stored in the code dictionary 3000 1300 enin 
memory 2050b. The RIP processor 2050a continues b yand readings the -"c-" immediately 
following the -"b-" in the sequence. The RIP processor 2050ba determines if a code exists for 
the character sequence -"be-". Since, in this example, no such code exist at this point in the 
processing, a new code 106 is generated to represent the new pattern string -"be"" an d the new 
code is stored in th^dictionary 3000. 1300. } 

100491 The RIP 2050 continues byand readings the ^"0^" immediately following the ""c-" in the 
sequence. The RIP processor 2050a determines if a code exists for the character sequence 
-"c0-". Since, in this example, no such code exists at this point in the processing, a new code 
107 is generated to represent the new pattern string -"c0-" and the new code is stored in the. 
dictionary 3000. 1300. Also, because the -"0-" is recognized as a special character , the RIP 
processor 205 0a 7 automatically scans ahead to read the next character in the sequence to 
determine if it matches with-the initial ^"0-!\_ inlf the sequence. I f hext character is not T _a_ 
matching "0." the scanning ahead is immediately discontinued and the RIP processor 2050a 
proceeds with normal processing. If se rthe next character is a matching "0." the scanning ahead 
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continues on-a* character by character-basis* until no match with " m atching " 0"" is found^-at- 
which . At that point* the scanning ahead is discontinued and normal processing continues. 

[00501 _ In this exemplary application of the AGFA technique, the RIP processor 2050a scans 
ahead and counts the number of -repeated " 0-" or — "1— " bytes in the sequence. Preferably, a 
compression threshold is pre-established and stored enin the RIP memory 2050b. For example, 
the threshold might correspond to a 4 to 1 compression rate. If such a threshold is utilized* and 
the number of " repeated " 0"" or ""1-" bytes counted is less than the number required to meet or 
exceed the threshold, e.g. if the sequence consists of only one or two zeros or ones, then a new 
code would be established for the sequence in the normal manner. Only if the number of 
" repeated " 0- ' or ^"1^" bytes counted meets or exceeds the threshold^ is the sequence encoded 
using the applicable pre-defined code 100 or 101. 

[00511_ Assuming in the present example that the number of ""0"" bytes counted by the RIP 
processor 2050a meets or exceeds the threshold, the bits in the " count is determined to 
b eposition" represent a repeat count. Either 9, 10, 1 1, or 12 bits can be used to code the repeat 
count. However, if the count is so great that more than 1 1 bits would be required for the 
encoding, a continue code which may be generated by the processor 2050a or retrieved from 
memory 2050b, is inserted as the least significant bit in the output code to enable . This continue 
code enables the output codes representing the entire sequence of zeros or ones to be strung 
together. Accordingl y Therefore . no matter how long the sequencejs, the low or leasst significant 
bit of each output code within the string of output codes would represent either an end code, or a 
continuation of the coding. Hence, 1 bit is sacrificed for the end/continuation bit leaving 8, 9, 10, 
or 1 1 bits for the repeat count. 

[00521_ Accordingly, in the present example* the output code for the repeat count of ""0-" 
characters would be formed wit husing the code ""100 11 " to indicate that this is a sequence of 
""0"" characters, followed by representing a first portion of the repeat count, and 

""001—" indicating that the output codes for the repeat count continues. Thus, the first code in 
the string of repeat count output codes would be ""100102001"". The second code in the string 
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of repeat count output codes could be " 102001". "! 02001 " with the ^102^ representing a 
second portion of the repeat count, and ""001 "" indicating that the output codes for the repeat 
count continues. The last code in the string of repeat count output codes could be -"020 1-". The 
high bit of the last output code -"0201 " is made cl e ar to indicat e" indicates that this is the end of 
the repeat count information in this field. 

[0053L Using the repeat count multiple output codes times, the strung -together codes for the 
entire repeat count would, in the above example be ^100102001 1020010201^" Thus, the 
strung together multiple bytes of output codes provide a full representation of the repeat count. In 
practice, five output codes may be used to represent up to four billion characters. 
Notwithstanding the number of bits in the output codes, the high bit is used to represent the 
count. Accordingly, whatever output code size is used, full advantage is taken of all available 
bits for the repeat count. 

It is perhaps worthwhile emphasizing here that conventiona l [00541 Conventional LZW 
techniques lack the ability to scan ahead. Conventional PB techniques, on the other hand, scan 
ahead to locate matches with whatever character has been read and must fully generate the match 
coding for each matching sequence. In contrast to both , the present invention scans ahead to 
locate matches with only selective characters, preferably only white and black, respectively 
represented herein by ^"0-" and ^"1-". Furthennore, the pre s ent invention use scan use a 
predefined code for each of the selected characters, e.g. white and blac k. Hence , and hence the- 
match coding for each matching sequence need only be partially generated^ since the-predefined 
codes, e.g. codes 100 or 101, which identifies th e applicabl e sequ e nc e a s a se quence of white or 
black characters i sare pre-generated and need only be read from the code dictionary 3000. 1300. 
Accordingly, the present invention is capable of providing superior encoding o f, for example, 
large images using less computing resources and computing time. 

[00551 As noted above, once the RIP processor 2050a determines it is at the last -"0-" in the 
sequence, i.e. by determining from fee-scanning ahead on a character by character ba s is that athe 
next character does not match witha ^ ^"0." the scanning ahead is discontinued and normal 
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processing continues resumes . Thus, the RIP processor 2050a continues by reading the -"1 
immediately following the last -"0-" in the sequence. The processor 2050a determines if a code 
exists for the character sequence ""01-". Since, in this example, no such code exist at this point 
in the processing, a new code 108 is generated to represent the new pattern string -"01 V and the 
new code is added to the dictionary 1300. The processor 2050a proceeds by reading the ""c-" 
immediately following the -"1 "" in the sequence. The processor determines if a code exists for 
the character sequence n "lc-". Since, in this example, no such code exists at this point in the 
processing, a new code 109 is generated to represent the new pattern string ""lc V and the new 
code is added to the dictionary 1300. 

[00561_ The processor ferthe r2050a then proceeds by reading the ""0"" immediately following 
the second ""c-" in the sequence. The processor 2050a determines if a code exists for the 
character sequence ""c0"". In this example, such a code, i.e. code 107, does exist. The RIP 
processor 2050a als ethen scans ahead to determine if another -"0-" immediately follows this 
occurrence of-"c0"". Since, in this case the RIP processor 2050a determination n ext character is 
negativ e . not a "0," the scanning ahead is discontinued and normal processing continues resumes . 

[00571_ The processor 2050a-new proceeds by reading the ""1 "" immediately following the 
second -"c0"" in the sequence. The processor 2050a determines if a code exists for the character 
sequence ""cOl-". Since, in this example, such a code does not exist, a new code 1 10 is 
generated to represent the new pattern string -"cOl " whic h " and the new code is added to the 
code dictionary 1300. Because the "1" is the last character, the combination of the last generated 
code and the last character can be represented as ""1071"". The RIP processor 2050a also scans 
ahead to determine if another -"1"" immediately follows this occurrence of ""cOl"". Since, in 
this case the RIP processor 2050a dotormination n ext character is n e gativ e . not a "1." the scanning 
ahead is discontinued and normal . Normal processing would continu e resume if further 
characters remained to be encoded. However, since the-^'cOl " " and 1 are the final characters, 
encoding ends. 
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[00581 _ The processor_2050a ultimately generates encoded output data forming a of the form 
" 1 02abc 1 00 1 0200 1 1 0200 1 020 1 1 07 1 1 03 " The sequence including the includes the encoded 
string of c hnr nc tar r, "102fthc1 001 02001 1020010201 107 11 03". and an end code, code 103. 

[00591„ Similar to the LZW techniques, in the AGFA technique^ the RIP processor 2050a builds a 
tree of numerous codes generated using a combination of p re-defined or other cod e s and 
generated codes. The AGFA technique is thereby is-capable of providing satisfactory results even 
though the processing is performed on a byte by byte basis to find repeating bytes. How e ver, as 
compare d Compared to LZW technique s , in the AGFA LZW technique, the AGFA technique 
requires substantially reduced p rocessing time and resources required by the RIP 2050 to encode 
large sequences is substantially reduc e d through the use o f because it uses special pre-defined 
codes. The decoding processor, i.e. the imager controller processor 2100 a, which could serve a s 
a printer controller (not shown) or be some other type decoding device, builds a similar tree using 
the codes in the code dictionary received from the encoding processor, i.e. the RIP processor 
2050a. Basicall y Aside from the imager controller processor 2100a . the decoding processor 
could serve as a printer controller (not shown) or be some other type of decoding device. The 
decoding processor performs the reciprocal of the encoding process to decode the encoded 
sequence characters ^102abcl001 02001 1020010201 1071 103*". It should be understood that 
the encoded data could if desired be transmitted to the decoding device via a direct 
communications link, a local network, a public network such as the Internet, or some other type 
of network. Further, such communications may be by wire communications or wireless 
communications. It will also be recognized by those skilled in the art that, while the invention 
has been described above in terms of one or more preferred embodiments, it is not limited 
thereto. Various features and aspects of the above described invention may be used individually 
or jointly. Furthe rmore , although the invention has been described in the context of its 
implementation in a particular environment and for particular purposes, e.g. imaging, those 
skilled in the art will recognize that its usefulness is not limited thereto and that the present 
invention ea nmay be beneficially utilized in any number of environments and implementations. 
Accordingly, the claims set forth below should be construed in view of the full breadth and spirit 
of the invention as disclosed herein. 
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